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This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING ENTERED 

3 (1) General Information: 
4 

5 (i) APPLICANT: KLEIN, Michel H 

6 DU, Run -Pan 

7 EWASYSHYN, Mary E 
8 

9 (ii) TITLE OF INVENTION: INFECTION DETECTION METHOD USING 

10 CHIMERIC PROTEIN 
11 

12 (iii) NUMBER OF SEQUENCES: 38 
13 

14 (iv) CORRESPONDENCE ADDRESS:- 

15 (A) ADDRESSEE: Sim & McBurney 

16 (B) STREET: 6th Floor, 330 University Avenue 

17 (C) CITY: Toronto 

18 (D) STATE: Ontario 

19 (E) COUNTRY: Canada 

20 (F) ZIP: MSG 1R7 
21 

22 (v) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS -DOS 

26 (D) SOFTWARE: PatentIn Release #1.0, Version #1.30 
27 

28 (vi) CURRENT APPLICATION DATA: 

2 9 (A) APPLICATION NUMBER: 

30 (B) FILING DATE: 

31 (C) CLASSIFICATION: 
32 

33 (vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: US 08/467,961 

35 (B) FILING DATE: 06-JUN-1995 
36 

3 7 , (vii) PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: US 08/001,554 

39 (B) FILING DATE: 06-JAN-1993 
40 

41 (vii) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NUMBER: GB 9200117.1 

43 (B) FILING DATE: 06-JAN-1992 
44 

45 (viii) ATTORNEY/AGENT INFORMATION: 

46 (A) NAME: STEWART, Michael I 
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47 (B) REGISTRATION NUMBER: 24,973 

48 (C) REFERENCE/DOCKET NXMBER: 1038-1000 MIS:jb 
49 

50 (ix) TELECOMMUNICATION INFORMATION: 

51 (A) TELEPHONE: (416) 595-1155 

52 (B) TELEFAX: (416) 595-1163 
53 

54 

55 (2) INFORMATION FOR SEQ ID NO : 1 : 
56 

57 (i) SEQUENCE CHARACTERISTICS: 

58 (A) LENGTH: 1844 base pairs 

59 (B) TYPE: nucleic acid 

60 (C) STRANDEDNESS : single 

61 (D) TOPOLOGY: linear 
62 

63 (ii) MOLECULE TYPE: DNA (genomic) 

64 

65 

66 

67 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

68 

69 AAGTCAATAC CAACAACTAT TAGCAGTCAT ACGTGCAAGA ACAAGAAAGA AGAGATTCAA 60 
70 

71 AAAGCTAAAT AAGAGAAATC AAAACAAAAG GTATAGAACA CCCGAACAAC AAAATCAAAA 120 
72 

73 CATCCAATCC ATTTTAAACA AAAATTCCAA AAGAGACCGG CAACACAACA AGCACCAAAC 180 
74 

75 ACAATGCCAA CTTTAATACT GCTAATTATT ACAACAATGA TTATGGCATC TTCCTGCCAA 240 
76 

77 ATAGATATCA CAAAACTACA GCATGTAGGT GTATTGGTCA ACAGTCCCAA AGGGATGAAG 3 00 

78 

7 9 ATATCACAAA ACTTCGAAAC AAGATATCTA ATTTTGAGCC TCATACCAAA AATAGAAGAC 360 
80 

81 TCTAACTCTT GTGGTGACCA ACAGATCAAA CAATACAAGA GGTTATTGGA TAGACTGATC 420 
82 

83 ATCCCTCTAT ATGATGGATT AAGATTACAG AAAGATGTGA TAGTAACCAA TCAAGAATCC 480 
84 

85 AATGAAAACA- CTGATCCCAG AACAAGACGA TCCTTTGGAG GGGTAATTGG AACCATTGCT 540 
86 

87 CTGGGAGTAG CAACCTCAGC ACAAATTACA GCGGCAGTTG CTCTGGTTGA AGCCAAGCAG 600 
88 

89 GCAAAATCAG ACATCGAAAA ACTCAAAGAA GCAATCAGGG ACACAAACAA AGCAGTGCAG 660 
90 

91 TCAGTTCAGA GCTCTATAGG AAATTTAATA GTAGCAATTA AATCAGTCCA AGATTATGTC 720 
92 

93 AACAACGAAA TGGTGCCATC GATTGCTAGA CTAGGTTGTG AAGCAGCAGG ACTTCAATTA 780 
94 

95 GGAATTGCAT TAACACAGCA TTACTCAGAA TTAACAAACA TATTTGGTGA TAACATAGGA 840 
96 

97 TCGTTACAAG AAAAAGGAAT AAAATTACAA GGTATAGCAT CATTATACCG CACAAATATC 900 
98 

99 ACAGAAATAT TCACAACATC AACAGTTGAT AAATATGATA TCTATGATCT ATTATTTACA 960 
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100 

101 GAATCAATAA AGGTGAGAGT TATAGATGTT GATTTGAATG ATTACTCAAT CACCCTCCAA 1020 
102 

103 GTCAGACTCC CTTTATTAAC TAGGCTGCTG AACACTCAGA TCTACAAAGT AGATTCCATA 1080 
104 

105 TCATATAATA TCCAAAACAG AGAATGGTAT ATCCCTCTTC CCAGCCATAT CATGACGAAA 1140 
106 

107 GGGGCATTTC TAGGTGGAGC AGATGTCAAG GAATGTATAG AAGCATTCAG CAGTTATATA 1200 
108 

109 TGCCCTTCTG ATCCAGGATT TGTACTAAAC CATGAAATGG AGAGCTGCTT ATCAGGAAAC 1260 
110 

111 ATATCCCAAT GTCCAAGAAC CACGGTCACA TCAGACATTG TTCCAAGATA TGCATTTGTC 1320 
112 

113 AATGGAGGAG TGGTTGCAAA CTGTATAACA ACCACCTGTA CATGCAACGG AATCGACAAT 1380 
114 

115 AGAATCAATC AACCACCTGA TCAAGGAGTA AAAATTATAA CACATAAAGA ATGTAATACA 1440 
116 

117 ATAGGTATCA ACGGAATGCT GTTCAATACA AATAAAGAAG GAACTCTTGC ATTCTACACA 1500 
118 

119 CCAAATGATA TAACACTAAA TAATTCTGTT GCACTTGATC CAATTGACAT ATCAATCGAG 1560 
120 

121 CTTAACAAAG CCAAATCAGA TCTAGAAGAA TCAAAAGAAT GGATAAGAAG GTCAAATCAA 162 0 
122 

123 AAACTAGATT CTATTGGAAA CTGGCATCAA TCTAGCACTA CAATCATAAT TATTTTAATA 1680 
124 

125 ATGATCATTA TATTGTTTAT AATTAATGTA ACGATAATTA CAATTGCAAT TAAGTATTAC 1740 
126 

127 AGAATTCAAA AGAGAAATCG AGTGGATCAA AATGACAAGC CATATGTACT AACAAACAAA 1800 
128 

129 TGACATATCT ATAGATCATT AGATATTAAA ATTATAAAAA ACTT 1844 
130 

131 (2) INFORMATION FOR SEQ ID NO : 2 : 
132 

133 (i) SEQUENCE CHARACTERISTICS: 

134 (A) LENGTH: 53 9 amino acids 

135 (B) TYPE: amino acid 

136 (C) STRANDEDNESS : single 

137 (D) TOPOLOGY: linear 
138 

139 
140 

141 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

142 

143 Met Pro Thr Leu lie Leu Leu lie lie Thr Thr Met lie Met Ala Ser 

144 15 10 15 
145 

146 Ser Cys Gin lie Asp lie Thr Lys Leu Gin His Val Gly Val Leu Val 

147 20 25 30 
148 

149 Asn Ser Pro Lys Gly Met Lys lie Ser Gin Asn Phe Glu Thr Arg Tyr 

150 35 40 45 
151 

152 Leu lie Leu Ser Leu lie Pro Lys lie Glu Asp Ser Asn Ser Cys Gly 



PAGE: 4 RAW SEQUENCE LISTING DATE: 02/01/2000 

PATENT APPLICATION US/09/479,240 TIME: 23:28:20 

INPUT SET: S34590.raw 

153 50 55 60 

154 

155 Asp Gin Gin lie Lys Gin Tyr Lys Arg Leu Leu Asp Arg Leu lie lie 

156 65 70 75 80 
157 

158 Pro Leu Tyr Asp Gly Leu Arg Leu Gin Lys Asp Val lie Val Thr Asn 

159 85 90 95 
160 

161 Gin Glu Ser Asn Glu Asn Thr Asp Pro Arg Thr Arg Arg Ser Phe Gly 

162 100 105 110 
163 

164 Gly Val lie Gly Thr lie Ala Leu Gly Val Ala Thr Ser Ala Gin lie 

165 115 120 125 
166 

167 Thr Ala Ala Val Ala Leu Val Glu Ala Lys Gin Ala Lys Ser Asp lie 

168 130 135 140 
169 

170 Glu Lys Leu Lys Glu Ala lie Arg Asp Thr Asn Lys Ala Val Gin Ser 

171 145 150 155 160 
172 

173 Val Gin Ser Ser He Gly Asn Leu He Val Ala He Lys Ser Val Gin 

174 165 170 175 
175 

176 Asp Tyr Val Asn Asn Glu He Val Pro Ser He Ala Arg Leu Gly Cys 

177 180 185 190 
178 

179 Glu Ala Ala Gly Leu Gin Leu Gly He Ala Leu Thr Gin His Tyr Ser 

180 195 200 205 
181 

182 Glu Leu Thr Asn He Phe Gly Asp Asn He Gly Ser Leu Gin Glu Lys 

183 210 215 220 
184 

185 Gly He Lys Leu Gin Gly He Ala Ser Leu Tyr Arg Thr Asn He Thr 

186 225 230 235 240 
187 

188 Glu He Phe Thr Thr Ser Thr Val Asp Lys Tyr Asp He Tyr Asp Leu 

189 245 250 255 
190 

191 Leu Phe Thr Glu Ser He Lys Val Arg Val He Asp Val Asp Leu Asn 

192 260 265 270 
193 

194 Asp Tyr Ser He Thr Leu Gin Val Arg Leu Pro Leu Leu Thr Arg Leu 

195 275 280 285 
196 

197 Leu Asn Thr Gin He Tyr Lys Val Asp Ser He Ser Tyr Asn He Gin 

198 290 295 300 
199 

200 Asn Arg Glu Trp Tyr He Pro Leu Pro Ser His He Met Thr Lys Gly 

201 305 310 315 320 
202 

203 Ala Phe Leu Gly Gly Ala Asp Val Lys Glu Cys He Glu Ala Phe Ser 

204 325 330 335 
205 
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206 Ser Tyr lie Cys Pro Ser Asp Pro Gly Phe Val Leu Asn His Glu Met 

207 340 345 350 
208 

209 Glu Ser Cys Leu Ser Gly Asn lie Ser Gin Cys Pro Arg Thr Thr Val 

210 355 360 365 
211 

212 Thr Ser Asp lie Val Pro Arg Tyr Ala Phe Val Asn Gly Gly Val Val 

213 370 375 380 
214 

215 Ala Asn Cys lie Thr Thr Thr Cys Thr Cys Asn Gly lie Asp Asn Arg 

216 385 390 395 400 
217 

218 lie Asn Gin Pro Pro Asp Gin Gly Val Lys lie lie Thr His Lys Glu 

219 405 410 415 
220 

221 Cys Asn Thr lie Gly lie Asn Gly Met Leu Phe Asn Thr Asn Lys Glu 

222 420 425 ' 430 
223 

224 Gly Thr Leu Ala Phe Tyr Thr Pro Asn Asp lie Thr Leu Asn Asn Ser 

225 435 440 445 
226 

227 Val Ala Leu Asp Pro lie Asp lie Ser lie Glu Leu Asn Lys Ala Lys 

228 450 455 460 
229 

23 0 Ser Asp Leu Glu Glu Ser Lys Glu Trp lie Arg Arg Ser Asn Gin Lys 

231 465 470 475 480 

232 

23 3 Leu Asp Ser lie Gly Asn Trp His Gin Ser Ser Thr Thr lie lie lie 

234 485 490 495 

235 

236 He Leu He Met He He He Leu Phe He He Asn Val Thr He He 

237 500 505 510 
238 

23 9 Thr He Ala He Lys Tyr Tyr Arg He Gin Lys Arg Asn Arg Val Asp 

240 515 520 525 

241 

242 Gin Asn Asp Lys Pro Tyr Val Leu Thr Asn Lys 

243 ' 530 535 
244 

245 

246 (2) INFORMATION FOR SEQ ID NO: 3: 
247 

248 (i) SEQUENCE CHARACTERISTICS: 

249 (A) LENGTH: 1833 base pairs 

250 (B) TYPE: nucleic acid 

251 (C) STRANDEDNESS : single 

252 (D) TOPOLOGY: linear 
253 

254 
255 

256 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

257 

258 AGACAAATCC AAATTCGAGA TGGAATACTG GAAGCATACC AATCACGGAA AGGATGCTGG 60 
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